Exploring Newspapers: a Case Study in Corpus Analysis
نویسندگان
چکیده
This paper details the analysis of text data taken from a leading Scottish newspaper. In this work, the purpose is twofold. Firstly, we report on the adopted approach and the application of simple corpus analysis tools and techniques. Secondly, we report some of the findings from our analysis of this particular newspaper corpus.
منابع مشابه
A Contrastive Analysis of Sports Headlines in Two English Newspapers
It holds true that a flourishing fieldof Contrastive Rhetoric (CR) research has begun to address theway various text types and/or genres may differ across culturesand languages (Corner, 1996). Very much in line withthis development, this study was an attempt to characterizethe linguistic structures of headlines in the sports section of 2 English newspapers: one non-Iranian (The Times) and one ...
متن کاملVocabulary Lists for EAP and Conversation Students
Despite the abundance of research investigating general and academic vocabularies and developing dozens of word lists, few studies have compared academic vocabulary with general service word lists such as conversation vocabulary. Many EAP researchers assume that university students need to know all the words in West’s (1953) General Service List (GSL) as a prerequisite to academic words (e.g., ...
متن کاملA System for Identifying and Exploring Text Repetition in Large Historical Document Corpora
We present a software for retrieving and exploring duplicated text passages in low quality OCR historical text corpora. The system combines NCBI BLAST, a software created for comparing and aligning biological sequences, with the Solr search and indexing engine, providing a web interface to easily query and browse the clusters of duplicated texts. We demonstrate the system on a corpus of scanned...
متن کاملThe Presence and Influence of English in the Portuguese Financial Media
As the lingua franca of the 21st century, English has become the main language for intercultural communication for those wanting to embrace globalization. In Portugal, it is the second language of most public and private domains influencing its culture and discourses. Language contact situations transform languages by the incorporations they make from other languages and Portugal has...
متن کاملBuilding and annotating a corpus for the study of journalistic text reuse
In this paper we present the METER Corpus, a novel resource for the study and analysis of journalistic text reuse. The corpus consists of a set of news stories written by the Press Association (PA), the major UK news agency, and a set of stories about the same news events, as published in various British newspapers. In some cases the newspaper stories are rewritten from the PA source; in other ...
متن کامل